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PREFACE 


The  Complexity  Theory  CRP  (9000-506)  was  initiated  to  study  new  developments  in 
Complexity  Theory  and  report  on  any  techniques  that  may  be  applicable  to  IDA  tasking. 
The  initial  tasking  was  to  investigate  up  to  three  promising  areas  in  Complexity  Theory. 
The  first  area  involved  developing  new  separations  results  for  Krentel's  OptP  classiHcation 
[Kr87]  of  NP  optimization  problems.  The  second  area  was  to  examine  the  possible 
extension  of  Krentel's  absolute  error  bounds  for  NP  approximations  to  relative  error 
bounds,  while  the  third  area  involved  examination  of  the  EP  theory  of  parallel  algorithms 
by  Kruskal,  Randolph,  and  Snir. 

The  development  of  new  separation  results  would  allow  Krentel's  absolute  error  bounds 
on  polynomial-time  approximation  to  be  extended  to  cover  more  types  of  NP  optimization 
problems,  while  the  relative  approximation  results  represented  new  work.  The  importance 
of  any  error  bounds  was  deemed  important  in  algorithm  design.  By  providing  a  method  to 
determine  theoretical  limits  on  the  approximabili^,  algorithm  designers  could  spend  less 
time  working  on  problem  variations  not  likely  to  yield  good  approximations.  EP  theory 
was  deemed  important  since  it  addressed  parallel  efficiency  and  represented  an  alternative  to 
NC  theory  in  parallel  algorithm  design.  Of  these  three  areas,  two  were  addressed  and  one 
was  reported  on  under  this  CRP. 

The  first  subtask  was  to  seek  new  proofs  of  separation  for  certain  OptP  classes  that  were 
not  yet  known  to  be  separable,  to  apply  these  separations  to  show  absolute  approximation 
bounds  for  classes  where  they  did  not  previously  apply.  Several  simple  proofs  were 
discovered  that  showed  new  separations  between  certain  classes  based  upon  the 
assumption  that  NPg  DTIME(n‘°*  “)  and  other  weaker  results.  As  it  turned  out,  these 
proofs  were  simplifications  of  proofs  previously  discovered  by  Krentel  and  Gasarch. 
Beigel  [Be88]  recently  proved  an  exhaustive  series  of  separation  results  that  extended 
Krentel's  work.  Because  of  the  completeness  of  this  recent  work  no  further  refinement  of 
this  area  was  attempted  under  this  CRP. 

The  second  subtask  was  to  investigate  the  possibility  of  using  Query  bounds  in  the  spirit 
of  Gasarch  and  Krentel  to  prove  relative  bounds  on  approximations.  Krentel's  work 


provided  absolute  accuracy  bounds.  Relative  results  yield  more  information,  and  are 
generally  more  useful  in  practice.  Under  this  CRP  a  new  method  was  discovered  to  bound 
the  relative  error  of  approximations  for  certain  limited  classes  of  NP-Complete  optimization 

problems.  These  results  were  then  extended  to  yield  \  -approximatitm  results  for  several 

optimization  problems. 

The  third  subtask  was  to  investigate  the  recently  developed  EP  theory  that 
characterizes  parallel  algorithms  by  their  efficiency.  Time  did  not  permit  investigation  of 
this  area. 

This  paper  was  reviewed  extensively  throughout  its  life  cycle  by  members  of  the 
Institute  for  Defense  Analyses  review  team;  Dr.  Eric  Roskos,  Dr.  Cy  Ardoin,  Dr.  Michael 
Kappel,  Mr.  Steve  Edwards,  Mr.  Jim  Baldo,  and  Mr.  Terry  Mayfield. 
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1  INTRODUCTION 


This  paper  presents  a  new  method  for  determining  lower  bounds  on  the  relative  accuracy 
attainable  for  polynomial-time  approximations  to  certain  restricted  types  of  NP-Complete 
optimization  problems  herein  referred  to  as  1)  bounded  optimizations,  2)  NP  constrained 
optimizations,  and  3)  A-optimizations.  The  method  makes  use  of  query  analyses  with 
Oracle  Turing  machines,  with  the  goal  to  develop  a  consistent  approach  for  determining 
lower  bounds  for  accuracy  in  poly-time  approximations.  The  methods  presently  allow  k- 
approximation  results  to  be  shown  for  these  restricted  NP  optimization  problems  and 
allows  -approximation  results  to  be  extended  to  unrestricted  problems.  The  goal  is  to 

find  methods  that  allow  k-approximation  and  E-approximation  results  to  be  extended  to 
unrestricted  problems. 

The  ability  to  show  lower  bounds  on  error  has  ramifications  in  algorithm  design.  Since 
no  exact  polynomial-time  solutions  are  known  for  any  NP-Complete  optimization 
problems,  algorithm  designers  must  often  resort  to  polynomial-time  approximation 
algorithms  for  these  problems.  This  process  often  involves  inventing  variations  of  the 
underlying  problem  by  simplification,  substitutions,  or  other  methods  in  an  attempt  to  find 
a  problem  that  can  easily  be  approximated.  Having  theoretical  tools  to  show  lower  bounds 
on  error  for  some  of  these  problem  variations  would  allow  the  algorithm  designer  to 
quickly  discard  approaches  that  cannot  be  approximated  to  the  required  degree  of  accuracy. 

The  first  class  of  problem  studied  in  this  paper  is  the  bounded  optimization  problem. 

This  is  a  NP  optimization  problem  where  the  number  of  elements  of  the  wimess  that  may 
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participate  in  the  opdniization  is  bounded.  As  an  example,  consider  BOUNDED  CLIQUE. 
This  problem  consists  of  a  graph  G,  and  a  set  of  distinguished  vertices  U.  We  ask  how 
many  of  the  distinguished  vertices  can  be  found  in  any  one  clique  of  size  k.  Since  the 
optimal  solution  is  never  more  than  |U|,  the  optimization  is  bounded  by  how  large  U  may 
be  as  a  function  of  the  vertex  set  V.  Although  this  problem  class  is  contrived  to  aid  in  the 
query  analyses,  it  yields  some  problems  that  are  interesting  in  their  own  right. 

The  second  type  of  restricted  optimization  is  the  NP  constrained  optimization.  In  this 
problem  we  are  given  two  instances  of  an  NP-Complete  problem  I],  and  I2-  Ii  is  used  to 
constrain  the  legal  witnesses  of  I2,  and  I2  is  the  optimization  problem.  An  example  of  this 
type  of  problem  is  C  MAX  SAT.  This  problem  consists  of  two  boolean  formulas  (pj,  and 
<p2  over  variables  X.  The  goal  is  to  output  the  maximum  number  of  clauses  in  (p2  that  can 
be  simultaneously  satisfied  by  a  wimess  that  completely  satisfies  (pj. 

The  third  type  of  optimization  is  the  A-optimization.  In  this  problem  we  are  given  two 
instances  of  a  problem  Ij,  and  I2.  The  goal  is  then  to  compute  the  optimization  on  both 
instances  and  output  their  difference.  As  an  example,  consider  A-CLIQUE.  In  this 
problem  we  are  given  two  graphs  G^,  and  G2.  Let  k^  be  the  maximal  clique  in  G^,  and  k2 
be  the  maximal  clique  in  G2.  We  wish  to  know  how  much  larger  k2  is  than  kj. 

The  main  technique  utilized  in  this  paper  is  oracle  query  analysis.  It  is  used  to  establish 
the  number  of  queries  required  to  solve  a  problem  on  a  Oracle  Turing  machine.  The 
query  requirements  are  based  on  some  complexity  theoretic  assumption  such  as  P^NP,  or 
NIVR.  That  is,  the  problem  will  require  at  least  the  number  of  queries  specified  as  long  as 
the  complexity  theoretic  assumption  holds.  The  initial  query  requirements  are  established 
by  proofs  that  directly  show  the  assumption  is  violated  if  a  base  problem  can  be  solved  in 
fewer  than  the  required  number  of  queries.  Metric  reductions  are  then  used  to  propagate 


query  requirements  to  other  problems.  The  reductions  are  constructed  to  allow  the  new 
query  requirements  to  be  determined  from  the  query  requirements  of  the  source  problem. 

Once  the  query  requirements  have  been  established,  they  can  be  used  to  develop  lower 
bounds  on  the  accuracy  of  polynomial-time  approximations  assuming  the  complexity 
theoretic  assumption  holds.  In  this  paper  k-approximation  results  are  established  for 
several  restricted  problems.  The  k-approximation  results  can  be  extended  to 

"V -approximation  results  for  several  unrestricted  optimizations  problems  including 

CLIQUE,  and  MAXIMUM  INDEPENDENT  SET.  Extensions  to  k-approximation  and  e- 
approximations  have  met  with  limited  success  since  all  uiuestricted  k-approximation  results 
so  far  found  can  be  proven  by  other  means. 

The  remainder  of  the  paper  is  organized  as  follows.  Section  2  presents  the  definitions 
required  to  develop  query  requirements  and  presents  two  approximation  bound  proofs. 
Section  3  presents  query  requirements  for  a  base  problem  LEX  and  uses  them  to  show 
examples  of  query  requirement  proofs  for  problems  in  each  of  the  three  restricted 
optimization  classes.  Section  4  utilizes  these  results  to  show  approximation  bounds  for 
restricted  and  unrestricted  NP  optimization  problems. 
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2  APPROXIMATION  BOUNDS  FROM  QUERY  BOUNDS 


This  section  establishes  the  tools  required  to  show  k-approximation  bounds  for  NP 
optimization  problems  based  upon  the  number  of  queries  required  to  solve  them  on  a 
Oracle  Turing  Machine  (OTM).  Two  theorems  are  then  presented  that  use  query  bounds  to 
show  lower  bounds  on  relative  and  absolute  error.  The  first  theorem  is  a  modified  version 
of  Krentel's  theorem  for  absolute  approximation  bounds  and  allows  absolute 
approximation  bounds  to  be  expressed  in  terms  of  natural  problem  size  measures.  A  new 
theorem  is  then  introduced  that  uses  precise  query  bounds  and  the  number  of  bits  required 
to  express  the  solution  to  get  relative  approximation  bounds. 

2.1  DEFINITIONS 

In  this  section  we  introduce  the  concept  of  a  "natural"  measure  for  bounding  queries  in 
an  NPC  optimization  problem.  These  concepts  are  used  to  show  theorems  for  bounding 
absolute  and  relative  errors  of  polynomial  time  approximations.  The  natural  measures  will 
be  1)  within  a  polynomial  of  any  reasonable  encoding,  2)  natural  for  bounding  the  queries 
required  undev  the  intuitive  binary  search  using  NP  queries,  and  3)  natural  for  bounding 
the  size  of  the  solution  in  bits.  Examples  include  the  number  of  variables  for  LEX-SAT 
and  other  SAT  wimess  problems,  or  the  number  of  vertices  for  COLORING,  CLIQUE, 
and  other  vertex  counting  graph  problems,  etc.  The  concept  of  a  "natural"  query  measure 
and  notations  are  formally  introduced  below. 

DEFINITION  2.1.1  Natural  measure,  A  natural  measure,  mfj:!!— >N,  is  function  that 

relates  a  "size"  to  some  instance  of  problem  FI  in  a  natural  and  standard 
way.  For  example,  mfj(G)  =  |V|  is  a  natural  measure  for  CHROMATIC 
NUMBER({V3}))-  This  will  be  also  be  abbreviated  as  "n-measure". 


5 


NOTATION  2.12  When  dealing  with  known  problem  n-measures,  the  actual  function 
will  be  explicitly  written.  For  example,  for  graph  problems  where  the 
queries  are  measured  in  terms  of  the  size  of  the  vertex  set  we  will  use  |V| 
instead  of  mjj(G).  When  discussing  unspecified  problems  with 
unspecified  n-measures  we  will  use  the  symbol  m.  Sometimes  this  will 
also  be  written  as  m(x)  to  emphasize  that  the  n-measuie  value  is  a  function 
of  the  input  instance. 

DEFINITION  2.1.3  Query  requirement,  >N,  is  function  that  describes  the 

number  of  queries  required  to  solve  an  instance  I  of  problem  11  on 
machine.  The  query  requirements  are  relative  to  some  complexity 
theoretic  premise,  such  as  P^NP,  or  NP?*RP,  etc.  Sometimes  the  query 
requirements  for  a  specific  problem  will  be  written  with  the  problem  name 
as  the  subscript.  For  example,  Qcliq  the  query  requirement  for  the 
problem  CLIQUE. 

NOTATION  2.1.4  When  dealing  with  problems  with  known  query  requirements  as  a 
function  of  their  n-measure,  we  will  write  the  function  explicitly.  For 
example,  a  logarithmic  query  requirement  for  Vertex  Cover  would  be 
written  as  loglVj.  When  dealing  with  unspecified  query  bounds,  the 
symbol  "Q(m)"  will  be  used. 

DEFINITION  2.15  k-approximation.  Problem  11  has  a  polynomial-time  k- 
approximation  if  there  exists  a  polynomial-time  algorithm  that  guarantees 

I  ApproXj,(x)  -  optj^(x) 
optn(x) 


<  k(optjj(x)). 


If  k(z)  is  a  constant  function,  the  approximation  result  can  also  be  expressed  in  terms  of 
overapproximations  and  underapproximations.  An  overapproximation  approaches  a 
solution  from  above,  while  an  underapproximation  approaches  a  solution  from  below.  A 
k- approximation  is  one  that  can  guarantee  (Vx)  Approx(x)  <  (k+l)opt(x)  for 
overapproximations,  and  (Vx)  Approx(x)  ^  (k-l)opt(x)  for  underapproximations.  Both 
notations  will  be  used  in  this  paper. 

2.2  ERROR  BOUND  THEOREMS 

The  theorems  presented  in  this  section  establish  lower  bounds  on  the  absolute  and 
relative  accuracy  attainable  for  polynomial  time  approximations  to  NP-Complete 
optimization  problems  assuming  query  requirements  Q(/n(x)).  Since  the  query  requirement 
is  conditional  on  some  complexity  theoretic  assumption  such  as  P^NP,  NP^RP,  etc;  this 
allows  lower  bounds  to  be  established  for  various  complexity  theoretic  assumptions. 
These  theorems  are  stated  without  reference  to  this  condition  to  allow  them  to  be  more 
generally  applicable.  In  Section  3  these  theorems  will  be  applied  assuming  IVNP. 

Theorem  2.1  below  is  an  adaptation  of  Krentel’s  OptP  approximation  results  showing 
that  the  absolute  accuracy  attainable  by  a  polynomial  time  approximation  is  bounded  by 
Theorems  2.2  and  2.3  impose  a  relation  between  query  requirements  and  output 
bits  required  to  bound  the  relative  accuracy  of  polynomial-time  approximations  to  bounded 
as  well.  Corollaries  2.4  and  2.5  give  k-approximation  bounds  when  the  difference 
between  query  requirements  and  bits  required  to  express  the  solution  is  a  constant. 

THEOREM  2.2.1  If  FI  requires  Q(m)  queries  then  no  polynomial-time  approximation  can 
guarantee  |lApprox(x)-opt(x)||  ^  (NOTE:  ||x||  is  absolute 

value) 
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PROOF  If  n  requires  Q(m)  queries  then  by  definition  it  cannot  be  calculated  in 

polynomial  time  with  Q(m)-1  queries  unless  the  associated  premise  is  • 

false.  If  n  can  be  approximated  to  within  in  polynomial  time, 

then  an  OTM  algorithm  can  be  built  that  violates  the  query  requirement 

as  follows.  Assume  IT  can  be  approximated  to  within  in  • 

polynomial  time.  Run  the  polynomial  time  approximation  so  opt(x)  is 

known  to  within  2^^"^*^^^  Then  run  a  deterministic  binary  search  with 

an  NPC  oracle  to  resolve  the  last  Q(/«)-l  bits  and  solve  11  in  Q(m)-1  0 

queries.  Therefore,  any  polynomial-time  approximation  must  have 

II Approx(x)-opt(x)||  <  2Q('")‘^  infinitely  often.O 


We  now  use  the  absolute  accuracy  bound  of  Theorem  2. 1  to  show  a  new  result  for 
relative  approximation  bounds  as  well.  This  result  can  be  used  to  show  that  for  certain 
problems,  no  poly-time  approximations  can  guarantee  (Vx)[Approx(x)  ^  k»opt(x)]  below 
certain  cut-off  values  of  k.  Since  Theorem  2.1  only  gives  us  information  about  the 
absolute  value  of  the  error,  ||Approx(x)-opt(x)||,  we  need  to  specify  if  the  approximation 
approaches  the  optimum  from  above  or  below.  Most  relative  error  assessments  deal  with 
approximations  approaching  from  above,  so  Theorem  2.2  deals  with  this  case.  A  similar 
result  for  approximations  approaching  from  below  is  shown  in  Theorem  2.3. 


It  should  be  noted  that  although  these  theorems  can  bound  the  accuracy  attainable  by 
such  approximations,  they  say  nothing  about  the  existence  of  such  approximations. 
Existence  is  proven  by  exhibiting  an  algorithm. 
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THEOREM  222  If  problem  11  requries  Q(m)  queries  and  opt(x)  never  requires  more  than 
Q’(»»)  bits,  then  no  poly-time  approximation  for  IT  approaching  from 
above  can  guarantee  (Vx)Approx(x)<[  1+2*^^  )  Q(  )  jQpt(x). 

PROOF  From  Theorem  2.1  we  know  that  ||Approx(x)-opt(x)l|  =  Q(  2‘3('")-^).  In 

what  follows  |x|  is  the  length  of  x  in  bits.  Let  opt(x)  be  such  that 
(Vx)[|opt(x)|  ^  Q'(m(x))],  so  opt(x)  <  for  all  inputs  of  n- 

measure  m.  We  can  then  construct  the  ratio 

IIApproxCx)  -  opiOOII  2^  ,  -  W  -  ■  infiritelyofien 

opt(x)  2^(«)  ^ 

Since  the  numerator  is  a  lower  bound  and  the  denominator  is  an  upper 
bound,  the  inequality  holds.  Recalling  that  Approx(x)  k  opt(x)  and 
solving  to  get  Approx(x)  and  opt(x)  on  different  sides,  we  get 

Approx(x)  ^[l+2^”*^  infinitely  often. 0 

THEOREM  22.3  If  problem  IT  requries  Q(m)  queries  and  opt(x)  never  requires  more  than 
Q'(m)  bits,  then  no  poly-time  approximation  for  11  approaching  from 
below  can  guarantee  (Vx)Approx(x)  >(  1-2^”’^  *)opt(x). 

PROOF  Using  reasoning  similar  to  Theorem  2.2  and  recalling  that 
Approx(x)  ^  opt(x),  we  get 

Approx(x)  ^[l-2^”*^  *]opt(x)  infinitely  often  0 
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COROLLARY  22.4  If  the  queiy  requirements  and  optimization  constraint  for  problem  11 
differ  by  an  additive  constant  k,  then  no  poly-time  approximation  for  11 
approaching  from  above  can  guarantee  (Vx)[Approx(x)  <  k*opt(x)]  for 


k  ^ 


PROOF  By  Theorem  2.2.  0 


COROLLARY  22.5  If  the  query  requirements  and  optimization  constraint  for  problem  IT 
differ  by  an  additive  constant  k,  then  no  poly-time  approximation  for 
n  approaching  from  below  can  guarantee  (Vx)[Approx(x)  >  k*opt(x)] 
for 


k^ 


,K+1 


PROOF  By  Theorem  2.3.  0 
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3  QUERY  BOUNDS  FOR  RESTRICTED  NP 
OPTIMIZATION  PROBLEMS 


This  section  presents  precise  query  bounds  for  several  restricted  NP  optimization 
problems  in  terms  of  "natural"  problem  size  measures,.  The  query  bounds  Q(w)  describe 
the  number  of  NP  queries  required  to  solve  the  problem  on  a  P^  oracle  machine,  and  are 
constructed  assuming  P?6NP.  That  is,  no  P^**  oracle  machine  can  guarantee  a  solution 
with  fewer  than  Q(m)  queries  unless  P=NP.  Additionally,  Q(m)  will  be  described  as  a 
function  of  some  natural  measure  of  the  input  problem  size  such  as  variables  in  a  boolean 
formula,  vertices  in  a  graph,  etc.  The  proofs  follow  in  two  steps.  First,  query 
requirements  are  established  for  a  base  problem.  Second,  standard  metric  reductions  are 
then  used  to  propagate  the  query  bounds  to  other  problems. 

The  query  bound  proofs  for  the  base  problem  make  use  of  the  work  of  Krentel  [Kr87], 
and  improvements  made  by  Beigel  [Be88],  for  establishing  query  bounds  for  the  base 
problem  C^-LEX(i.e)iog  n-  The  query  requirements  for  the  base  problem  C^-LEX(i.E)iog  „ 

is  established  in  two  steps.  First,  the  problem  is  shown  to  be  NP-Hard  by  showing  a 
reduction  to  the  NP  version  C^-SAT.  C^-LEXq.gjiog  „  is  then  shown  to  require  (l-e)log  n 

queries  unless  P=NP. 

Once  the  base  problem  is  established,  metric  reductions  are  exhibited  to  establish  query 
requirements  for  several  restricted  NP  optimization  problems.  Reductions  are  based  partly 
on  the  OptP  proofs  of  Gasarch  and  Pearlman  [GP88],  and  appropriate  modifications  of 
NP  proofs  in  the  literature.  The  query  bound  proofs  work  by  carefully  examining 
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reductions  that  preserve  the  number  of  queries  required  for  solution  on  a  oracle 
machine.  These  examinations  allow  the  optimization  constraints  to  be  carefully  measured 
in  terms  of  the  natural  problem  metrics.  Care  is  also  taken  to  ensure  that  the  optimization 
constraints  match  the  query  requirements  so  that  the  theorems  of  Section  2  can  be  used  to 
obtain  bounds  on  relative  error.  Since  most  of  these  reductions  are  adaptations  of  well- 
known  reductions  in  the  literature,  only  a  brief  sketch  is  presented  for  several  reductions. 

3.1  NAMING  CONVENTION 

The  naming  convention  for  problems  in  this  section  will  be  to  place  the  optimization 
bound  as  a  subscript  to  the  problem.  Thus  CLIQUE^^uj  is  the  CLIQUE  problem  with  an 

optimization  bound  of  f(n). 

3.2  PROVING  QUERY  REQUIREMENTS  FOR  C^.LEX(,.e),og  „ 

C  -LEX(i.g)iog  „  is  a  variation  of  the  FP  problem  LEX^(n)  introduced  by  Krentel.  The 

only  additional  restriction  is  that  the  number  of  clauses  is  at  most  IXj*^  where  |X1  is  the 
number  of  variables.  C  -LEX^^^j  can  be  proven  to  require  /(n)  queries  for  certain  f(n)  by 

the  methods  of  Krentel  and  Beigel.  The  additional  restriction  on  the  number  of  clauses  is 
required  in  the  reduction  to  VAR  SAT.  In  this  paper  we  will  use  C^-LEX(i.g)iog  ^  and 
prove  it  requires  (l-e)log  n  queries  by  Beigel’s  approach.  A  formal  definition  of  LEXg^j,j 
and  C^-LEXg(j,j  follows. 

DEFINITION  3.2.1  LEXg^^j  Let  <p  be  a  boolean  formula  in  CNF  format  with  an 
ordered  set  of  distinguished  variables  X  =  {uo,...,u^(n)}  where  n  is  the 
total  number  of  variables,  and  let  (O  be  a  wimess  that  satisfies  <p.  Output 
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the  maximum  value  of  /(©)  over  any  wimess  O),  where  /(©)  is  defined 


as. 


/ 


/(©)  =  max 


true  in  cd 
x.€X 


) 


DEFINITION  3.22  -LEX^^^j,  Let  <p  be  a  boolean  formula  in  CNF  format  with 

with  at  most  |V|^  clauses  where  |V|  =n  is  the  number  of  variables  in  <p, 
and  a  set  X  =  {uo,...,uy(n)}  of  distinguished  variables.  Let  ©  be  a 
witness  that  satisfies  <p.  Output  the  maximum  of  /(©)  over  any  witness 
©,  where  /(©)  is  deHned  as  above. 


The  proof  of  query  requirements  proceeds  by  first  showing  that  a  formula  in  C’^-format  is 
NP-Complete  for  any  k>l.  It  then  follows  that  C2-LEX(i^)tog  „  wiU  be  an  NP-Complete 

optimization  problem.  Beigel's  proof  will  be  restated  with  the  appropriate  modifications  to 
prove  that  C2-LEX(i.g)iog  ^  requires  (l-e)log  n  queries  for  e>0  unless  P=NP. 

3.2.1  C*‘-SAT  is  NP-Complete 

C  -SAT  is  a  variation  of  SAT  where  the  number  of  clauses  is  at  most  IXI*^  and  |X|  is  the 
number  of  variables.  C*^-SAT  is  easily  seen  to  be  in  NP  by  guessing  the  satisfying 
assignment  C  -SAT  can  also  be  shown  to  be  NP-Complete  for  all  k>l  by  examination  of 
Cooke’s  proof  for  SATe  NP-Complete. 

THEOREMS .2. 1. 1  SAT  C'^-SAT 

PROOF  Cooke’s  proof  produces  a  boolean  formula  in  C^-SAT  for  some  p  since 

the  number  of  clauses  is  polynomial  in  the  input  tape.  Therefore  we 


only  need  to  show  that  Vk  <  p,  p  >  1,  C^’-SAT  ^  C*^-SAT.  Let  (p  be  an 
instance  of  C^-SAT  with  variables  X,  |X|  =  n,  and  |(p|  =nP.  Create  a 
new  formula  v  padding  9  with  a  single  disjunction  of  variables 
{uj,U2,  The  number  of  variables  will  be  adjusted  to  bring  C*^- 

format  into  C^-format 

We  get  \|r  =  <pa(ui,  U2,...,Uj).  We  now  want  Ivl  <  where  N=j+n  is 

the  number  of  variables  in  y.  Since  (>|/1  =  |<p|  +1  =  n^+l,  we  want  n^+l 

p 

^  (n+j)*^.  Solving  for  j  we  get  j  ^Vn**  +  1  -  n .  So  let  j  =  n*^.  This  is 

a  polynomial  reduction  since  we  are  adding  only  a  polynomial  number 
of  variables.O 

3.2.2  C^*LEX(|.e)iog  b  Requires  (l-e)log  n  Queries 

From  Section  3.1  we  saw  that  C^-LEX(i.e)]ogB  is  an  NP-Complete  optimization  problem 
since  C^-SAT  is  NP-Complete.  We  now  show  that  C^-LEX(i.e)jog  ^  requires  (l-e)log  n 

queries.  The  following  sketch  is  a  modification  of  the  proof  that  appears  in  Krentel  [Kr87] 
and  improved  by  Beigel  [Be88].  It  shows  that  if  (<p,K)e  C^-LEX(i.e)iog  „  can  be  solved 
with  less  than  (l-e)log  n  queries,  then  tpeSAT  can  be  solved  in  polynomial  time  and 
P=NP. 

The  proof  works  by  assuming  an  OTM  Mg  exists  to  solve  (9,K)e  C^-LEX(i.E)iog  n  in 
((l-e)log  n)  -  1  queries.  The  machine  Mg  is  then  used  to  build  a  new  algorithm  to  solve 
(pe  SAT  in  polynomial  time.  The  new  algorithm  works  by  simulating  Mg  on  variations  of 
q>  until  enough  information  is  extracted  to  construct  a  witness  to  9.  The  wimess  is  then 
tested  to  see  if  9e  SAT.  Beigel  shows  that  the  information  extracting  rounds  can  be 
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performed  in  polynomial  time  so  SATeP.  The  only  modification  to  the  proof  is  to  ensure 
that  it  will  work  for  formulas  in  C?  format  This  is  established  by  the  following  lemma. 

LEMMA  32.2.1  Let  <p  be  a  boolean  formula  in  C^-format  with  n  variables.  Let  <p'=<pA\|/, 
be  a  new  formula  formed  by  augmenting  the  original  formula  <p  by  y 
where  \|r  has  alog(n)2“*°*^“^  clauses  and  alog(n)  new  variables.  If  a<l 
then  9*  is  also  in  format 

PROOF  Since  <p  is  in  format, .we  know  that  I9I  <  ixf .  Let  1X1  =  n.  We  will 

add  new  variables  so  |X'|  =  n+alog(n).  The  C^-format  then  requires 
l<p'l  <  IX'l^  =  n^  +  2nalog  n  +  (alog  n)^. 

To  get  <p'  we  are  adding  otlog(n)2“*°*^“^  new  clauses  so  we  get 
alog  0  2  n 

l<P'l  =  l<Pl  +  (a  log  n)2  <  n  +  an“log  n. 

So  for  <p'  to  be  in  C^-format  we  must  have 
n  +  an“log  n  ^  n  +  2nalog  n  +  (a  log  n) 
which  surely  holds  for  a  ^  1.  0 

We  now  present  a  sketch  of  Beigel's  proof.  The  only  modification  is  to  ensure  that 
Beigel's  proof  can  handle  the  formatting  restriction. 

THEOREMS. 2.22  C2-LEX(i.g)iog  ^  requires  (l-e)log  n  queries  unless  P=NP. 

PROOF  Assume  that  Mg  is  an  OTM  that  solves  an  instance  of  C2-LEX(i^)iog  ^ 

in  ((l-e)log  n)  -  1  queries,  as  long  as  the  input  formula  is  in  format. 
Simulate  the  action  of  Mg  on  9  for  all  queries.  We  can  do  this  since 
there  are  no  more  than  0(log(n))  queries.  Call  the  queries  variables  yj. 
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y2»***«y(i-e)iog  n-i*  We  now  have  the  lowest  (l-e)log  n  variables, 
{xi,X2,  •••»X(i.e)iog  n}»  of  ^  satisfying  assignment  as  a  function  of 
{yi*y2»  "»y(i-e)iogn-i}*  Write  the  functional  dependency  between  the  x 
and  y  variables  as  a  truth  table  with  columns  and  (l-e)log  n 

rows.  The  truth  table  can  now  be  written  as  a  boolean  formula  \|f  in 
CNF  format  with  ((l-e)log  <  nlog(n)  clauses. 

We  now  make  a  new  formula  <p'  =  <pA\jr,  with  variables  {yi,...,y(i. 
E)iogn-i*  Xi,...,Xn}.  Note  that  by  lemma  3.2.2. 1  above  9'  is  guaranteed 
to  be  in  C^-format.  Rename  the  variables;  {yj,. .  .,y(i.E)iog  n-i.  ^(i-E)iog 
n+i»'*">^n»  ^i’*"»^(i-c)iog  n^*  The  variables  {xj,...»x^j.gj]Qg  at  the 
end  of  the  list  are  now  the  dependent  variables,  and  all  others  are  the 
independent  variables.  The  new  formula  is  now  processed  repeatedly 
using  the  same  simulation  of  Mg  until  all  of  the  xj  variables  are  known 
in  terms  of  y  variables.  We  now  try  all  assignments  to  the  y  variables  to 
get  the  values  of  the  x  variables.  If  any  of  them  is  a  satisfying 
assignment  then  9  is  satisfyable,  otherwise  it  is  not. 

Krentel  and  Beigel  show  that  the  above  process  can  be  accomplished 
in  polynomial  time  if  A/g  exists.  Thus  if  we  can  solve  C^-LEX(i.£)iog  ^ 
in  fewer  than  (l-e)log  n  queries  then  P=NP.O 

3.3  QUERY  REQUIREMENTS  FOR  BOUNDED  OPTIMIZATIONS 

This  section  establishes  query  bounds  for  bounded  optimizations.  These  are  problems  in 
which  the  number  of  elements  in  the  problem  that  are  allowed  to  participate  in  the 
optimization  is  bounded  as  some  function  of  the  input  size.  The  problems  in  this  section 


are  proven  to  have  specific  query  bounds  Q(m)  unless  P=NP.  That  is,  no  oracle 
machine  can  guarantee  a  solution  with  fewer  than  Q(m)  queries  unless  P=NP. 

All  problems  are  reduced  to  a  base  problem  called  C^-VAR  SAT(n/2)(i-c)-  C^-VAR 
SAT(f^/2)(i-E).  is  an  instance  of  SAT  where  a  set  of  of  the  variables  are  "marked", 

and  the  number  of  clauses  is  at  most  quadratic  in  the  number  of  variables.  The  problem  is 
to  output  the  maximum  number  of  marked  variables  that  are  true  in  any  satisfying 
assignment.  C^-VAR  SAT(j^/2)(1-e)  is  proved  to  require  (l-E)log(N/2)  queries  unless  P=NP 
by  reduction  to  C^-LEX(i.£)jog  ^  from  Section  2.  Several  other  problems  are  then  reduced 
to  this  base  problem  to  obtain  query  bounds. 

3.3.1  Bounded  Optimization  Problems 

A  bounded  optimization  problem  is  stated  in  three  parts;  Jt,  f,  and  M.  The  first  part,  7t, 
is  an  NP  problem  definition  including  the  requirements  for  a  wimess,  O).  The  second  part 
is  the  optimization  function  /(©)  over  the  witnesses.  The  third  part,  M,  is  the 
optimization  constraint  and  bounds  the  maximum  value  that  the  optimized  problem  may 
take  as  a  function  of  the  input  problem  size.  This  constraint  may  restrict  the  number  of 
elements  in  a  problem  instance  that  can  participate  in  the  optimization. 

As  an  example  consider  a  whimsical  version  of  the  Travelling  Salesman  Problem.  This 
problem  is  constructed  by  requiring  the  TSP  route  to  be  a  Hamiltonian  cycle  that  maximizes 
the  weights  from  some  preferred  edge  set.  That  is,  we  make  a  TSP  route  between  all 
cities,  but  wish  the  route  to  use  certain  preferred  city  to  city  flights  when  possible.  This 
could  be  called  the  Frequent  Flyer  TSP  since  it  seeks  to  maximize  mileage  on  certain 
selected  legs  of  the  excursion,  presumably  those  that  have  been  designated  as  "bonus 
flights"  by  the  airline.  Let  the  preferred  edge  set  be  P,  this  problem  is  then  expressed  as 


max, 


^FFTSP- 


^FFTSP- 


]^w(ej)  <  where  |V|  is  the  total  number  of  vertices. 


Notice  that  in  Mpp  j^p  above,  the  bound  limits  the  number  of  edges  that  participate  in  the 
optimization  since  the  sum  of  preferred  edge  weights  must  be  less  than  the  total  number  of 
edges  in  any  Hamiltonian  cycle  through  the  graph.  It  is  also  a  bound  on  the  magnitude  of 
any  optimal  solution. 

3.3.2  Base  Problem:  C^-VAR  SAT(N/2)(i*e) 

The  base  problem  for  all  CNPO  reductions  needs  to  be  able  to  "count"  constraints  that 
are  violated  This  is  accomplished  in  a  variation  of  SAT  by  counting  true  variables  from  a 
prespecified  set  of  "marked"  variables.  As  with  all  NPCO  problems,  the  size  of  the 
"marked"  set  is  a  function  of  the  problem  size.  A  formal  definition  of  C^-VAR  SAT(n/2)(1- 
e)  follows. 

DEFINITION  3.32.1  C^-VAR  SAT(N/2)(i-e) 


Tt^-vAR  SAT*  ^  boolean  formula  9  in  CNF  format  with  at  most  |Vp  clauses  where 
|V|  is  the  number  of  variables  in  9.  X  =  {uq,  u^(n)}  is  an  ordered  set  of 
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distinguished  variables  where  n  is  the  total  number  of  variables.  A  witness 
CO  is  a  satisfying  assignment  to  9. 


/(0))c.var  sat-  ^  1 constant  function  outputting  1. 

X.  true  in  <0 

‘  XjCX 

Mc-var  sat-  ^  (N/2)^'‘®^  where  N  is  the  total  number  of  variables. 

We  can  now  show  that  C^-VAR  SAT(n/2)(*-^)  requires  (l-e)log(N/2)  queries  unless 
P=NP  by  a  reduction  to  C2-LEX(i^)iog  j,.  Notice  that  in  the  LEX  problem,  the  subscript 

refers  to  the  length  of  the  output  bit  vector  in  terms  of  the  number  of  variables  in  the  LEX 
instance.  In  the  VAR  SAT  problem,  the  subscript  is  the  size  of  the  "marked"  variable  set 
in  terms  of  the  number  of  variables  in  the  VAR  SAT  instance.  The  order  difference  in  these 
functions  is  due  to  the  fact  that  the  VAR  SAT  instance  only  counts  elements  from  the  set, 
while  the  LEX  instance  effectively  weights  the  elements  by  2‘  before  counting  them. 

3. 3. 2.1  Construction 

The  prcx)f  proceeds  by  exhibiting  a  construction  of  a  C^-VAR  SAT(N/2)(i-e)  instance  9 
from  an  instance  <p  of  C^-LEX(i^)iog  The  construction  will  be  a  collection  of  boolean 
formulas  over  new  variables  from  the  sets  Z,  U,  C,  and  the  original  LEX  instance.  The  Z 
variables  are  used  to  simulate  the  T  weighted  X  variables  from  LEX.  The  XJ  and  M 
variables  are  unmarked  and  marked  dummy  variables  that  are  included  only  to  make  the 
mathematical  analysis  easier.  The  Z  variables  will  be  used  to  construct  the  formula  o, 
which  ensures  that  a  subset  of  Z  will  all  be  true  for  each  distinguished  variable  Xj  from 
LEX.  The  dummy  variables  will  be  collected  into  a  single  clause  5. 
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Using  the  procedure  below  a  new  formula  v  is  created  consisting  of  a  concatenation  of 
the  formulas  9,  o,  and  5.  After  the  construction  is  complete,  the  new  VAR  SAT  instance 
is  shown  to  be  in  C^-format  if  the  original  LEX  instance  was.  The  optimal  solution  to  LEX 
is  then  shown  to  be  obtinable  from  the  optimal  solution  to  VAR  SAT,  completing  the 
reduction. 


1)  Construction  of  o.  For  each  of  the  distinguished  variables  {x2,.  •  •,X(i.e)iog  n)  in 
<p  create  a  clause  using  i  new  variables  Z  =  {Zj^j,...,2j  j}  where  i  is  the  subscript  of 
the  distinguished  variable  Xj.  The  clauses  will  have  the  form 

Ci=[(Xj  V  ~Zj^j)a(X.  V  ~Z.^)a...A(Xj  V  ~Z  i.,)]A[(~X.  V  ~Zj^j)a...a(~Xj  V  -z 

i,2  i.2 

The  new  formula  o  will  then  be 


cr=: 


(l-e)log  n 


Ac. 


2)  Construction  of  5.  5  consists  of  a  single  disjunction  of  all  of  the  dummy 
variables;  M={mi,m2},  and  U  =  {  Ui„..,UB.n(i-E)}-  The  new  formula  will  then  be 
5  =  (mi  V  m2  V  Ui  V  ...  V  Un-nd-e))- 

3)  Let  9  =  <p  A  a  A  5,  where  <p  is  the  original  instance  of  C^-LEX(i.e)iog  Mark 
only  the  variables  in  ZuMu{xi}.0 

We  now  need  to  show  that  9  is  indeed  in  C^  format,  assuming  that  9  was. 

CLAIM  3. 3. 1.1.1  The  instance  \|r  of  C^-VAR  SAT(N/2)(i-e)  is  in  C^-format  if  the  input 

instance  9  is  in  C^-format. 
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PROOF 


We  first  need  to  count  up  the  total  number  of  clauses  and  variables  in  v, 
so  we  first  calculate  \L\.  For  each  xj  we  create  2*'^  new  z  variables 
which  gives  us  an  expression  for  |Z|  as 


(l-e)log  n 

U2 


(l-e)logn-l 

j-i 


-  2  new  z  variables 


The  total  number  of  variables  can  now  be  calculated  as 


|X'|  =  |Z|+|U|+|M1+IX|  =  2|X|  =  2n 

For  each  x^  we  create  2*  new  clauses.which  gives  us  an  expression  for 
the  total  number  of  new  clauses  in  a  as 

(l-e)log  n 

iv:2 


Since  |5|  =  1  we  have  the  total  number  of  clauses  as 

The  formatting  restriction  on  9  ensures  that  |<p|  ^  n^,  and  the 
formatting  restriction  on  y  requires  that  |\jr|  <  |X’|^  =  4n^.  Recalling  that 
(1-e)  <  1  we  get 

lYl  <  n^  +  -  3  <  2n2  <  =  IX'l 

So  \|/  is  in  format.O 
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From  the  analysis  above  we  can  also  calculate  the  number  of  maiked  variables  in  terms 
of  the  total  number  of  variables  in  \|f.  Let  X'  be  the  complete  variable  set,  and  jX'l  =  N.  ^ 

The  number  of  "marked"  variables  is  then  (N/2/*’^^  This  expression  is  placed  in  the 
subscript  of  C^- VAR  S  AT(n/2)(1-®)-  The  only  task  remaining  is  to  show  that  the  solution  to 
C2-LEX(i^)iog  n  can  be  obtained  from  the  solution  to  C^-VAR  SAT(n/2)(^‘®)  “  polynomial  ® 

time. 

CLAIM  3.3.1.12  The  optimal  solution  to  C^-LEX(j.E)iog  „  can  be  obtained  from  the  # 

optimal  solution  to  C^-VAR  SAT(n/2)(^-^)  opt|.F.Y=  oPWarsat  ‘ 


PROOF  In  y,  the  only  variables  marked  will  be  in  ZuMu{xi}.  The 

construction  of  a  requires  all  of  the  zj  variables  to  be  true  iff  the 
corresponding  Xj  is  true.  The  number  of  Zj  variables  in  y  for  each  Xj 
variable  in  <p  is  the  exact  weight  of  Xj  in  LEX,  so  the  number  of  Zj  and 
Xj  variables  true  will  be  exactly  the  ouq)ut  from  LEX.  The  construction 
of  5  requires  that  all  variables  in  M  be  true  in  the  optimal  solution,  so 
they  are  subtracted  off.  0 


3. 3. 2. 2  C^-VAR  SAT(N/2)(i-e)  Requires  (l-e)log(N/2)  Queries 
THEOREM3.3.1.2.1  C^-VAR  SAT(n/2)<*-®)  requires  (l-E)log(N/2)  queries  unless  P=NP. 


PROOF  From  the  reduction  we  see  that  the  number  of  variables  in  y  is  1X'|  =  N 

=  2n,  where  n  is  the  number  of  variables  in  <p,  the  instance  of  C^-  • 

LEX(i.g)iog  If  we  can  solve  y  in  less  than  (l-e)log(N/2)  queries  then 
we  could  solve  9  in  less  than  (l-e)log  n  queries  by  converting  it  to  y 
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using  the  reduction  procedure.  This  violates  Theorem  3.2.22  so  C^- 
VAR  SAT(N/2)(i-e)  requires  (l-e)log(N/2)  queries  unless  P=NP.O 


3.3.3  Weighted  Var  SAT:  WV  SAT 

w 


WV  sat  is  a  generalization  of  C^-VAR  SAT  where  the  distinguished  variables  may  cany 
t  weights  ^  1.  The  only  restriction  is  that  the  total  sum  of  the  weights  not  exceed  (N/2)^^*^\ 

DEFINITION  332.1  WV  SAT(N/2)(i-e) 

’^WSAT-  ^  boolean  formula  <p  in  CNF  format  with  at  most  |Vr  clauses  where  |V|  is 
the  number  of  variables  in  <p.  X  =  {xq,  is  a  set  of  distinguished 

variables  with  weights  w(Xi)  and  where  N  is  the  total  number  of  variables. 
^  A  wimess  0)  is  a  satisfying  assignment  to  (p. 


/(©)wv  SAT=  max^  ^  w(x.) 

X.  tnie  in  u 


► 


I 


► 


W(Xj)  <-y 
x.cX 

C^-VAR  SAT  is  a  restriction  of  WV  SAT  where  the  formula  is  required  to  be  in  C^- 
format  and  all  weights  for  the  distinguished  variables  are  1.  It  then  follows  that  WV  SAT 
requires  (l-e)log(N/2)  queries  unless  P=NP. 
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3.3.4  Weighted  Vertex  Cover:  WVC  u 


WVC  is  a  weakening  of  the  standard  minimal  vertex  cover  problem  MIN  VC.  In  WVC 
we  are  given  a  graph  G  and  permitted  to  weight  some  subset  of  the  vertices.  We  are 
seeking  the  minimum  weighted  vertex  cover  for  G.  That  is  we  seek  a  vertex  cover  ofr  G 
that  includes  the  minimum  total  weight  from  the  weighted  elements.  It  is  a  weakening  of 
MIN  VC  since  the  total  weight  of  all  distinguished  vertices  is  not  sufficient  to  allow  the 
MIN  VC  problem  to  be  calculated  by  weighting  all  vertices  by  1. 


‘■WVC- 


A  graph  G=(V^),  a  set  of  distinguished  vertices  U  with  weighting 
function  w  :  U  N,  and  maximum  vertex  cover  size  k.  A  wimess  co  is  a 
vertex  cover  on  G  that  is  of  size  ^  k. 


/(w)wvc’ 


mm. 


UjG  (0 

U.€U 


l-£ 


M 


WVC- 


,w(up^ 
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u.eU 


The  reduction  follows  in  two  stages.  The  first  is  a  reduction  from  C?-VAR  SAT  to  a 
restricted  version  of  WVC  where  all  weights  on  distinguished  vertices  are  1.  The  query 
requirement  from  this  restriction  then  follows. 
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The  reduction  from  C?-VAR  SAT  follows  the  standard  reduction  to  3-SAT  in  [GJ79] 
with  two  moditications.  The  first  is  that  the  vertices  in  each  clause  cluster  are  allowed  to 
number  up  to  |X|,  one  for  each  of  |X|  possible  vars  in  the  clause.  So  the  cluster  of  vertices 
for  each  clause  is  then  a  complete  graph  on  |C|  vertices  where  |C|  is  the  number  of  variables 
in  the  clause.  The  second  modification  is  that  the  marked  variables  from  C^-VAR  SAT 
now  allow  a  var  vertex  to  be  similarly  marked.  The  construction  follows. 

CONSTRUCTION 

1)  For  each  variable  x-  in  <p  create  two  vertices  v-  and  ~v..  Place  an  edge  between 
them. 

2)  For  each  clause  Cj  in  9  create  a  clique  of  size  |Cj|. 

3)  For  every  variable  in  clause  Cj  cormect  an  edge  to  the  appropriate  Vj  and  -’Vj.vertex 
dependening  upon  which  form  of  the  literal  appears  in  Cj. 

4)  For  each  distinguished  variable  x^  in  9  mark  the  appropriate  ~V|  vertex. 


5)  Let  k  =  |X|  +  ^|c.|  -  I9I  where  |X|  is  the  number  of  variables  in  9  and  I9I  is  the 
number  of  clauses. 

In  the  construction  above,  the  edges  radiating  out  from  the  clause  cluster  are  cormected  to 
other  variable  representing  vertices.  These  are  referred  to  as  spokes.  The  edges  between 
elements  of  a  single  cluster  are  referred  to  as  internal  edges.  The  edges  between  (vj  ,~Vj) 

pairs  are  also  internal  edges. 


Since  the  marked  variables  are  iiuqrped  directly  to  marked  vertices  it  is  easy  to  see  that  the 
maximum  number  of  marked  variables  in  a  wimess  to  (p  will  be  |U|  -  the  minimum  number 
of  marked  vertices  in  a  witness  to  G.  We  now  only  need  to  show  that  the  construction 
ensures  that  the  associated  formula  <p  is  satisfied  iff  there  is  a  vertex  cover  of  the  specified 
size.  This  is  established  in  the  following  lemma. 

LEMMA33.2.1  In  the  above  construction.there  is  a  vertex  cover  of  size  k  9  is 
sadsfiable. 

PROOF<f=  Assume  9  is  satisfiable.  For  every  variable  that  is  true  place  the  v  vertex 
into  the  cover,  and  for  every  variable  that  is  false  place  the  ~v  vertex 
into  the  cover.  This  uses  a  total  of  |X|  vertices.  Since  9  is  satisfied  by 

the  assignment  each  clause  has  at  least  one  variable  that  is  true  in  that 
clause.  Let  Uj  be  a  literal  that  supports  each  clause  c-.  For  each  clause 
cluster  Cj,  accept  all  vertices  except  Uj  into  the  vertex  cover.  Each 
clause  cluster  will  then  have  all  internal  edges  covered  and  the  cover  is 
now  of  size  k. 

Now,  for  each  clause  cluster  c^  all  spokes  are  covered  except  those 
connected  to  the  supporting  literal  u^.  But  since  the  supporting  literals 
are  true  by  defmition  they  will  also  be  in  the  cover  and  the  remaining 
spokes  are  covered. 

PROOF All  internal  edges  in  each  Vj  ,~Vj.pair  must  be  covered  so  any  vertex 

cover  must  contain  at  least  one  vertex  in  each  pair.  Thus  there  are  at 
least  |X|  (Vj  ,~v-)  vertices  in  the  cover.  Also,  each  clause  cluster  must 

have  all  internal  edges  covered.  It  can  be  proven  by  induction  that  a  k- 
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clique  has  a  minimum  vertex  cover  of  k-1  so  at  least  |Cj|  -1  vertices  from 
each  cluster  must  be  included.  Thus  there  are  at  least  k  vertices  in  the 
vertex  cover  of  G.  Since  k  is  the  size  limit  there  are  exactly  k  vertices  in 
the  vertex  cover  and  they  are  distributed  as  above. 

Since  each  cluster  has  |Cj|  -1  vertices,  each  cluster  has  exactly  one  spoke 

it  cannot  cover.  But  since  the  vertex  cover  exists  this  spoke  must  have 
been  picked  up  by  the  (Vj  ,~Vj)  vertex  at  the  other  end  of  the  spoke.  Let 
these  (Vj  ,~Vj)  vertices  be  the  true  variables.  Since  exactly  one  of  each 
(Vi,~Vi)  pair  is  in  the  cover,  they  form  a  proper  boolean  assignment. 
Thus  there  is  a  boolean  assignment  where  all  clauses  have  at  least  one 
supporting  true  literal. 

•j  ^ 

In  the  construction  the  total  number  of  vertices  N  will  satisfy  N  <  2n+n  ^  2n  ,  where  n 
is  the  number  of  vars  in  C^-VAR  SAT.  The  total  number  of  marked  vertices  is  t(n/2)^*'®\ 
We  must  rescale  the  optimization  constraint  and  query  requirement  from  (n/2)^'''^  to 

l-e 

flLV 

[  IdJ  where  N  is  the  total  number  of  vertices  in  WVC. 

3.3.5  Weighted  Max  Independent  Set;  WMIS 

(i^r 

WMIS  is  a  weakening  of  MAX  INDEPENDENT  SET  because  the  optimization 
constraint  is  insufficient  to  allow  all  vertices  to  be  counted  on  an  independent  set.  The 
problem  consists  of,  a  graph  Gs:(V,E),  a  minimum  independent  set  size  m,  and  a  set  of 
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preferred  vertices  U.  It  asks  for  a  maximally  weighted  subset  of  U  that  lies  on  an 
independent  set  over  V.  A  formal  definition  follows. 

A  graph  G=(V,E),  a  set  of  distinguished  vertices  U  with  weighting 
function  w  :  U  — >  N,  and  a  minimum  independent  set  size  m.  A  witness  co 
is  an  independent  set  on  G  of  size  ^  m. 


/(0))wMis- 


U  .E  <0 
1 

U.€U 


1-E 

Mwmis- 

u.eU 

The  reduction  follows  simply  from  WVC  For  any  graph  G  let  W  c  V  be  a  vertex  cover 
of  G.  Then  V-W  will  be  an  independent  set  of  G.  If  W  is  the  vertex  cover  that  minimizes 
WVC  then  V-W  will  be  the  independent  set  that  maximizes  WMIS  for  the  same  weighting 
function  and  set  of  distinguished  vertices  U.  The  minimum  independent  set  size  is  set  as  m 
=  IVl  -  k,  where  k  is  the  maximum  vertex  cover  size  in  WVC,  and  |V|  is  the  total  number 
of  vertices  in  G. 


JV| 

16 


3.3.6  Weighted  Clique:  WCLIQ  ^ 

(^r 

WCLIQ  is  a  weakened  generalization  of  the  MAX  CLIQ  problem  since  the  optimization 
constraint  is  insufficient  to  allow  the  maximum  clique  to  be  found  by  weighting  all  vertices 
by  1.  It  asks  for  the  maximally  weighted  subset  of  the  distinguished  vertices  that  lie  on  any 
clique  in  the  graph  G. 

^CUQ*  ^  graph  G=(V3),  a  set  of  distinguished  vertices  U  with  weighting 
function  w  :  U  — >  N,  and  a  minimum  clique  size  m.  A  wimess  (D  is  a 
clique  on  G  of  size  m  or  greater. 


/(®)wcliq' 


U.€  00 
u.cU 

I 


M, 


WCLIQ* 


1-e 


u,eU 


|V| 


The  reduction  follows  exactly  as  for  WVC  using  the  complement  of  the  graph  G*^. 
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3.3.7  FREQUENT  FLYER:  FF 

(^r 

The  FF  problem  consists  of  a  weighted  directed  graph  and  subset  P  of  the  edges  that  are 
designated  as  preferred.  The  problem  is  to  find  the  directed  Hamiltonian  cycle  through  the 
graph  that  includes  the  maximum  weight  from  the  set  of  preferred  vertices.  The  total  path 
length  is  of  no  concern.  This  problem  is  actually  a  Hamiltonian  cycle  problem,  instead  of 
a  more  complicated  Travelling  Salesperson  problem  since  the  total  path  length  is  not 
bounded.  A  formal  definition  follows. 

Ttpp:  A  weighted  graph  G=(V3).  and  a  set  of  preferred  edges  P.  A  wimess  © 

is  a  Hamiltionian  cycle  through  G. 

CjC  a> 

C.6P 

Mpp: 

e.eP 
1 

The  reduction  follows  in  two  parts.  The  first  part  is 
restricted  FF  problem  where  all  preferred  edges  get 
restricted  problem  then  follows. 


a  reduction  from  C^-VAR  SAT  to  a 
weight  1.  The  reduction  from  the 
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The  reduction  from  C^-VAR  SAT  follows  the  construction  for  reduction  from  3-SAT  in 
[AHU79]  with  the  obvious  generalization  to  allow  clauses  of  any  length.  The  preferred 
edges  are  chosen  as  the  Hrst  cross  edge  in  each  H-ladder  in  the  Aho,  Hopcroft,  and 
Ullman's  construction  for  any  variable  that  is  marked.  The  total  number  of  vertices  N  will 
then  satisfy  N  ^  2n+n^  ^  2n^,  where  n  is  the  number  of  variables  in  C^-VAR  SAT.  The 
total  number  of  preferred  edges  is  then  (n/2/^'®\  We  then  rescale  the  optimization 


constraint  and  query  requirement  from  (n/2)' 
vertices  in  the  graph. 


fNf 


where  N  is  the  total  number  of 


3.4  CONSTRAINED  OPTIMIZATIONS 

Constrained  optimizations  are  actually  two  NP  problems  11  ^  and  112  bound  together. 
The  first  problem  FIi  is  a  NP-Complete  set  problem  and  constrains  the  variables  in  the 
second  problem.  The  second  problem  112  is  an  NP-Complete  optimization  problem.  There 
is  generally  a  relation  between  IIi  and  112  limits  the  size  of  112  as  a  ^miction  of  the  size 
of  Ill. 


This  section  presents  one  constrained  optimization  problem  since  it  is  not  difficult  to 
convert  the  proofs  from  other  sections  into  constrained  optimization  proofs.  It  should  be 
noted  that  these  types  of  problems  are  not  particularly  interesting  as  realistic  optimization 
problems  that  might  be  encountered  in  practice.  They  are  presented  because  they  have  the 
best  sizing  ratios  of  all  of  the  restricted  problems  studied,  and  may  offer  the  best  hope  to 
extend  approximation  results  using  adversary  arguments  or  other  information  theoretic 
techniques.  The  problem  presented  in  this  section  is  called  CMAX  SAT^^^j.  A  formal 

definition  appears  below. 


31 


DEFINITION 


CMAX  Let  q)j  and  tpj  be  boolean  formulas  in  CNF  format 

where  |<P2l  ^  /(|<Pil).  Output  the  maximum  number  of  clauses  in  <p2  that 

can  be  simultaneously  satisfied  by  any  witness  ©  that  satisfies  all 
clauses  in 


CLAIM  3.4.1  CMAX  SAT„(i-e)  requires  log  n  queries  unless  P=NP. 


PROOF 


Reduce  CMAX  SAT„(i-e)  to  c2-LEX(i.E),og  „  as  follows.  Let  (p  be  an 
instance  of  C  -LEX^i.ejjog  3.  For  each  distinguished  variable  Xj  in 
{xi,,..,X(j.gjjgg  3}  of  <p  create  a  group  of  2*'*  singleton  clauses 
consisting  of  a  conjunction  of  i  copies  of  the  variable  Xj,  Let  92  be  this 
collection  of  clauses,  and  let  9j  be  9.  We  now  have 

(l-c)log  n  - 1 

l‘P2l  =  X  ^ 

i=0 


il-€ 


Since  I92I  ^  l9il  ,  I92I  -  /(l9il)  is  satisfied.O 


3.5  A-OPTIMIZATIONS 


A-optimizations  consist  of  two  NP  optimization  problems  11  j  and  112  112  is  an 

extension  of  Hi.  The  objective  is  to  compute  the  optimization  on  both  problems 
separately,  and  output  their  difference.  This  has  the  effect  of  determining  the  degree  to 
which  the  extension  was  able  to  affect  the  optimal  value.  As  with  constrained  optimizations 
there  is  generally  a  relation  between  111  and  112  ii^ts  the  size  of  112  a  function  of 
the  size  of  Hi- 
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Formally,  a  A-optimization  requires  that  Am  =  Hmj  -  m2\\  =  /(mj).  Where  mj  and  m2 
measure  the  problem  sizes.  If  the  relation  /  between  the  problem  sizes  is  polynomial  the 
degree  will  be  placed  as  an  exponent  on  the  A  in  the  problem  name.  For  example  A^-fl 
indicates  a  problem  with  a  quadratic  size  disparity. 

All  problems  in  this  section  are  reduced  to  a  problem  called  defined 

below. 

DEFINITION  3^.1  C^-MLEX^^^j ,  Let  (p  be  a  boolean  formula  in  CNF  format  where 
l<p|  <  n^  where  n  is  the  number  of  variables.  Output  the  LEX  max 
lowest  fin)  bits  of  any  witness  that  satisfies  c  clauses  where  c  is  the 
most  clauses  that  can  be  simultaneously  satisfied  by  any  witness. 

CLAIM  3J2  C^-MLEXjgg.j^  requires  logVn  - 1  queries  unless  P=NP. 

PROOF  Recall  that  C^-LEXj^g^^  requires  logVn  queries  unless  P=NP.  Create 

OTM  M  y  to  solve  C^-LEXj^g.^^  using  OTM  M2  that  solves  C^- 
MLEXicg.^^.  M j  will  simply  run  M2  and  make  one  additional  query  to 
see  of  9  is  satisfiable.  If  so  it  outputs  the  answer  from  M2  ,  if  not  it 
outputs  0  to  indicate  an  illegal  9.  Thus  M2  was  able  to  muster  logVn  - 1 
queries.O 

Query  bounds  on  C^-MLEXj^g^u  can  now  be  used  to  show  query  bounds  for  A*^^- 
MAX  SAT  defined  below. 

DEFINITION  5-5.5  A^^^-MAX  SAT,  Let  9j  and  92  be  boolean  formulas  in  CNF  format 
where  92  is  an  extension  of  9j  and,  I92I  -  l9il  =  ®‘'/|9j|.  Independently 
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compute  the  maximum  number  of  clauses  simultaneously  satisfiable  in 
<Pj  and  (P2>  and  output  the  difference. 

CLAIM  3J.4  A^'^-MAX  SAT  requires  logN|<Pj|  - 1  queries  unless  P=NP. 


PROOF 


Let  <p'  be  an  instance  of  C^-MLEXj^^.^^.  Let  9  be  9'  padded  with 
enough  variables  so  that  I9I  =  n^  exactly.  Let  9^  and  92  be  defined  as 

follows 


9,  =  (9)°.  92  =  (9)"(x,)'(Xj)^X3)"...(x^^/ 


logVn  -  1 


We  now  have  I92I  -  l9j|  =  Sl9j|,  and  the  problem  requires  log(^Vl9j|)-l 
queries.O 

The  methods  presented  above  can  also  be  applied  against  C^-LEX(i.g)iog  ^to  prove  A^'®- 
CLIQ,  A^'®-MIS,  A^^^-VC  require  log^VlVj  -  1  queries  unless  P=NP,  where  |V|  is  the 
number  of  vertices  in  the  graph. 
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4  APPROXIMATION  BOUNDS 


The  error  bound  proofs  of  Section  2  can  now  be  applied  to  the  query  bounds  of  Section 
3  to  get  absolute  and  relative  error  bounds.  Applying  the  theorems  directly  yields  strong 
error  bounds  for  1/2-approxiniations  on  many  restricted  NP  optimization  problems. 
Ideally,  one  would  like  to  be  able  to  extend  these  error  bounds  to  unrestricted  problems. 
Efforts  to  extend  these  bounds  under  this  CRP  have  met  with  mixed  results.  Strong  1/2- 
approximation  error  bounds  have  been  extended  for  several  problems,  however  closer 
inspection  revealed  that  these  bounds  could  be  attained  by  other  methods.  Efforts  to  extend 
k-approximation  bounds  to  problems  for  which  there  is  no  known  bound  (such  as  MAX 
CLIQUE,  and  MIS)  have  not  been  successful.  It  has,  however,  been  possible  to  extend 
^  -approximation  results  to  these  problems. 

The  remainder  of  this  section  is  arranged  as  follows.  First  1 /2-approximation  error 
bounds  are  presented  by  direct  use  of  the  theorems  of  Section  2.  An  example  is  then  given 
to  show  how  these  error  bounds  might  be  extended  to  unrestricted  problems.  The  section 
concludes  with  -approximation  results  extended  for  MAX  SAT. 

4.1  1/2-approximation  Results  for  Restricted  Optimizations 

CLAIM  4.1.1  All  bounded  optimization  problems  11  presented  in  Section  3.3,  have 
Approxji  >  3/2optjj  for  over-approximations  and  Approxj^  <  l/2optj| 

for  under-approximations  unless  P=NP. 
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PROOF  All  problems  in  Section  3.3  have  query  requirements  that  exactly  match 

the  maximum  number  of  bits  required  for  output.  Hie  results  then 
follow  from  Corollaries  2.4  and  2.5.0 


CLAIM  4.12  All  A-optimization  problems  IT  presented  in  Section  3.4,  have 
Approxj|>  5/4optj|  for  over-approximations  and  Approxj|<  3/4optjj  for 

under-approximations  unless  P=NP. 


PROOF  All  problems  in  Section  3.4  have  query  requirements  that  differ  from 

the  maximum  number  of  bits  required  for  output  by  an  additive  constant 
of  1.  The  results  then  follow  from  corollaries  2.4  and  2.5.0 


4.2  EXTENDING  k- APPROXIMATION  RESULTS  TO  UNRESTRICTED 

OPTIMIZATION  PROBLEMS 

This  section  presents  a  method  for  extending  the  error  bounds  from  bounded 
optimization  problems  to  unrestricted  versions.  1 /2-approximation  bounds  are  presented 
for  the  unrestricted  optimization  problems  KERNEL,  and  MINIMUM  EXACT 
COVERING.  The  1/2-approximation  results  on  many  bounded  optimization  problems  can 
be  extended  by  fairly  simple  constructions.  Unfortunately,  the  extensions  found  to  date 
can  also  be  proven  by  other  methods,  and  the  query  analysis  technique  has  not  been 
required.  Problems  for  which  there  are  no  known  error  bound  appear  to  be  more  difficult 
to  analyze  and  may  require  more  powerful  techniques  than  are  presented  here.  Formal 
definitions  of  the  KERNEL  and  MINIMUM  EXACT  COVERING  problems  appear  below. 


DEFINITION  4.2.1  Kernel  of  a  graph.  Let  G  =  (V,A)  be  a  directed  graph.  The  graph 
kernel  is  a  subset  of  vertices  V’of  V  such  that  for  any  vertex  ue  V-V’ 
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there  is  a  vertex  ve  V  such  that  the  directed  edge  (v,u)e  A.  Also,  for  all 
vertices  Vj.VjG  V,  the  edges  (vj.Vj)  or  (vj.Vj)  do  not  appear  in  A. 

DEFINITION  422  KERNEL,  Let  G  =  (V,A)  be  a  directed  graph.  Output  the  size  of  the 
largest  kernel  of  G. 

DEFINITION 42.3  Exact  set  covering.  Let  S  be  a  set,  and  let  C={Cj,C2,...,C„}  be  a 

set  of  subsets  over  S.  An  exact  covering  of  S  is  a  subset 
C’={C,,Cb,...,C2}  of  C  such  that  all  elements  of  S  appear  once  and 
only  once  in  some  subset  Cj  contained  in  C. 

DEFINITION  42.4  MINIMUM  EXACT  COVER  (MEC)  ,  Let  S  be  a  set, 
C={Cj,C2,...,C„}  be  a  set  of  subsets  over  S.  Output  the  size  of  the 

smallest  exact  covering  of  S. 

The  KERNEL,  and  MINIMUM  EXACT  COVER  problems  defined  above  have 
straightforward  bounded  versions  defined  below.  Both  of  these  problems  can  easily  be 
proven  to  have  no  1/2-approximations  using  the  methods  of  Section  2,  and  3. 

DEFINITION  42  J  BKERNEL^(,V|)«  Let  G  =  (V,A)  be  a  directed  graph,  and  V’  be  a  set 
of  distinguished  vertices  such  that  |V'|  =  /(1V|).  Output  the  largest  (or 
smallest)  number  of  distinguished  vertices  in  any  kernel  of  G. 

DEFINITION  4.2 .6  BOUNDED  OPTIMAL  EXACT  COVER^^^j  (BOEC^(„)) ,  Let  S  be 
a  set,  C={Cj,C2,...,Cj,}  be  a  set  of  subsets  over  S,  and  C  a  subset  of 
C  such  that  |C'l  =  /(|C|).  Output  the  largest  (or  smallest)  number  of 
distinguished  sets  in  C  in  any  exact  covering  of  S. 
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1/2-approxiination  bounds  can  now  be  proven  for  KERNEL  and  MEC  by  extending  the 
1/2-approxiniation  results  for  BKERNEL  and  BOEC.  The  proof  works  by  assuming  that 
there  exists  a  1 /2-approximation  algorithm  for  the  unrestricted  version,  and  then  using  this 
algorithm  to  build  a  polynomial-time  1/2-approximation  algorithm  for  the  bounded  version. 
Since  the  bounded  version  has  no  such  approximation  unless  P=NP,  then  the  unrestricted 
version  also  has  none  unless  P=NP.  The  formal  proof  for  BKERNEL  is  presented  below. 
A  similar  proof  can  be  made  for  BOEC,  and  many  other  bounded  optimization  problems. 

THEOREM  42.7  If  KERNEL  has  a  polynomial-time  1 /2-approximation,  then  so 
does  BKERNELy^lYU  for  any  polynomial  /. 


PROOF 


Let  G  be  an  instance  of  BKERNEL^^IYI^  Use  the  1 /2-approximation 

algorithm  for  KERNEL  to  build  a  polynomial-time  approximation 
algorithm  for  BKERNELy^jvjj  as  follows.  Let  /  be  of  the  form  Wn  for 

some  fixed  k.  Create  a  new  graph  G’  from  G  by  attaching  a  "plume”  of 
2 

2n*^  dummy  vertices  (where  n  is  the  number  of  vertices  in  G)  to  each 
distinguished  vertex  as  shown  in  figure  4.2.1  below. 
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We  now  use  the  algorithm  to  find  the  minimal  kernel  of  G'.  Notice  that 

2 

if  a  distinguished  vertex  in  G*  is  not  included  in  the  kernel,  then  n^  of 

the  dummy  vertices  in  the  plume  are  included.  Since  this  is  more  than 

the  total  number  of  vertices  in  G,  this  has  the  effect  of  minimizing  the 

number  of  distinguished  vertices  in  the  minimal  kernel  of  G'.  The  total 

number  of  vertices  y  iti  the  minimal  kernel  of  G'  will  then  be 
2_ 

p(n^  +  l)+q,  where  p  is  the  minimal  number  of  distinguished  vertices 
from  G,  and  q  is  the  total  number  of  non-distinguished  vertices  from 
G.  Let  ^  be  a  1 /2-approximation  of  y,  to  get  a  1 /2-approximation  of 

BKERNELy^lYjj  we  divide  ^  by  n^  +  1.  The  q  term  drops  out  and  we 
are  left  with  a  1/2-approximation  to  BKERNEL^^IYiyO 

4.3  EXTENDING  (l/ni')- APPROXIMATION  RESULTS  TO 
UNRESTRICTED  OPTIMIZATION  PROBLEMS 

The  k-approximation  results  of  Section  4.1  can  be  extended  to  yield  -^-approximation 

bounds  (for  some  rixed  k)  unless  P=NP.  These  results  are  weak  since  the  error  bound  is  a 
function  of  the  problem  size.  This  does,  however,  yield  some  useful  information  since 
there  are  several  known  algorithms  that  allow  approximations  of  this  form  [Me84].  The 
proof  follows  lines  similar  to  the  k-approximation  extensions  of  Section  4.2. 

The  -approximation  bounds  can  be  established  by  one  of  several  methods.  The  first 

n 

method  is  to  establish  query  requirements  for  the  unrestricted  problem  without  trying  to 
match  the  solution  bit  requirements.  Applying  Theorem  2.2  or  2.3  directly  can  then  yield 
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the  result  Using  this  method  it  is  possible  to  show 


— =-  approximation  results  for  the 


unrestricted  MAX  SAT  problem. 


The  second  method  is  to  develop  k-approximation  bounds  for  the  A-optimization  version 
of  the  problem  and  show  how  to  construct  an  approximation  algorithm  for  the  A- 
optimization  from  an  approximation  algorithm  for  the  unrestricted  optimization.  The  result 
is  a  proof  along  the  lines  of  that  presented  in  Hieorem  4.2.6.  An  example  of  this  second 
method  will  be  shown  below  using  the  query  requirements  for  A^^®-MAX  SAT  presented 
in  Theorem  3.4.2.  It  should  be  noted  that  these  weak  bounds  were  established  using  query 
requirements  for  C^'LEXiog.jn.  Stronger  results  can  be  established  when  query 
requirements  for  C^-LEX(i.£)iog  ^  are  used. 

THEOREM  43.1  MAX  SAT  has  no  polynomial-time  -  approximation  algorithm 


PROOF  Recall  that  Theorem  4.1.2  established  that  A*'®-MAX  SAT  has  no  1/4- 

approximation  unless  P=NP.  We  use  a  approximation  for 

unrestricted  MAX  SAT  to  construct  a  1 /4-approximation  for  A^^^-MAX 
SAT  as  follows.  Approximate  MAX  SAT  for  both  (pj  and  <P2 
independently.  Since  92  is  an  extension  of  9j  we  are  assured  that  the 
difference  in  these  two  values  will  be  within  l/4opt^i/6.j^j^  Since 
A^'®-MAX  SAT  has  no  1 /4-approximation  unless  P=NP,  we  conclude 
that  MAX  SAT  has  no  -  ap^noximation  .0 
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4.4  EXTENDING  e- APPROXIMATION  RESULTS  TO  UNRESTRICTED 

OPTIMIZATION  PROBLEMS 

Ideally,  one  would  like  to  extend  true  e-approximation  results  to  unrestricted  problems. 
Several  proofs  have  been  constructed  under  this  CRP  similar  to  those  for  extending  k- 
approximation  results  in  Section  4.2.  But  unfortunately,  all  proofs  constructed  to  date  can 
be  proven  by  other  means.  Problems  such  as  CLIQUE,  and  MAXIMUM 
INDEPENDENT  SET  have  no  known  approximation  bounds,  and  continue  to  resist  all 
but  -approximation  bounds  as  shown  in  Section  4.4. 

n 

It  may  be  possible  to  use  some  alternate  technique  such  as  adversary  arguments,  or 
some  other  information  theoretic  techniques  to  coaxe  true  k-approximate  or  e-approximate 
bounds  hpom  these  problems,  but  this  is  not  known. 
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5  CONCLUSION 


This  paper  presents  a  new  method  for  bounding  the  accuracy  attainable  by  a  polynomial¬ 
time  approximation  to  certain  classes  of  NP-Complete  optimization  problems. 
Y  -  approximation  bounds  are  presented  for  several  restricted  optimization  problems,  while 

-approximation  bounds  are  presented  for  several  unrestricted  problems.  The  method  is 

based  on  precisely  counting  the  number  of  queries  required  to  solve  the  problem  on  a 
oracle  machine,  and  comparing  it  to  the  maximum  number  of  bits  that  could  be  required  to 
represent  a  solution.  The  approximation  bounds  rely  on  an  underlying  asumption  that 
certain  complexity  classes  do  not  coincide,  such  as  P^NP,  NP^RP,  etc,  so  that  the 
approximation  bounds  hold  unless  the  assumption  does  not  hold.  The  "strength"  of  the 
approximation  results  depends  upon  the  strength  of  the  assumption  used  in  the 
construction. 

In  this  paper  all  proofs  are  based  on  the  assumption  that  IVNP.  Using  this  assumption 
several  restricted  NP  optimization  problems  in  graph  theory  and  combinatorics  are  proven 
to  have  no  polynomial- time  approximations  that  guarantee  accuracy  to  better  than  1/2  of 
optimal.  Due  to  the  nature  of  the  underlying  assumption  P^NP  the  optimizations  are 
generally  restricted  so  that  only  some  subset  of  the  instance  elements  may  participate  in  the 
optimization.  For  vertex  and  edge  problems  this  subset  was  restricted  to  be  at  most  0(Nn) 
of  the  total  vertices  in  the  graph.  For  the  coloring  problem  this  restraint  was  reduced  to 
0(Vn)  of  the  total  vertices  in  the  graph.  It  appears  that  these  constraints  can  be  further 
loosened  either  by  basing  the  proofs  on  a  less  stringent  assumption  such  as  NP^RP,  or  by 
finding  more  "efficient"  reductions. 
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Methods  are  also  shown  to  get  k-approximation  and  -approximation  results  for 

unrestricted  NP  optimization  problems.  Unfortunately,  all  k-approximation  results  found 
under  this  CRP  can  be  shown  by  other  means,  but  the  -^-approximation  results  can  be 

D 

shown  for  problems  that  had  no  previously  known  error  bounds.  These  results  are  weak 
since  the  error  bound  is  a  function  of  the  problem  size. 

It  may  be  possible  to  strengthen  the  error  results  for  these  unrestricted  problems  using 
adversary  arguments  or  information  theoretic  techniques.  This  would  be  a  significant 
result  Consider  the  Weighted  Max  Independent  Set  (WMIS)  problem.  It  is  currently 
known  that  the  unconstrained  optimization  version  of  this  problem  either  has  an  e- 
approximation,  or  has  no  k-approximation  at  all,  but  it  is  an  open  question  which  one 
holds.  Showing  that  uncontrained  WMIS  has  no  k-approximation  below  some  k  would 
establish  that  the  related  problem  MAX  MIS  has  no  e-approximation  at  all  and  hence  no  k- 
approximation.  This  would  be  a  very  strong  result  A  similar  new  result  may  be  possible 
for  the  unconstrained  CLIQUE  problem. 


44 


BIBLIOGRAPHY 


[Be88]  R.  Beigel.  NP-Hard  Sets  Are  P-Superterse  Unless  R=NP.  Technical 
Report  88-04,  The  Johns  Hopkins  University,  Dept,  of  Computer 
Science,  1988. 

[Ga8S]  W.  I.  Gasarch,  The  Complexity  of  Optimization  Functions.  Technical 
Report  TR-1652,  The  University  of  Maryland,  Dept,  of  Computer 
Science,  1985. 

[GJ79]  M.  R.  Garey  and  D.  S.  Johnson.  Computers  and  Intractability,  A  Guide  to 
the  Theory  of  NP-Completeness.  WH  Freeman  and  Co.  New  York, 
1979. 

[GP88]  W.  I.  Gasarch  and  S.  R.  Pearlman,  The  Complexity  of  Optimization 
Problems  Related  to  Partition.  Technical  Report,  The  University  of 
Maryland,  Dept,  of  Computer  Science,  1988. 

[Kr86]  M.  W.  Krentel.  The  Complexity  of  Optimization  Problems.  In  Proceedings 

of  the  18*^  annual  ACM  Symposium  on  Theory  of  Computing,  pages  69- 
76,  1986. 

[Me84]  K.  Melhom.  Graph  Algorithms  and  NP -Completeness  EATCS 
Monographs  in  Theoretical  Computer  Science,Springer  Verlag,  Heidelberg, 
1984. 


45 


Distribution  List  for  IDA  Paper  P-2347 
NAME  AND  ADDRESS  NUMBER  OF  COPIES 


\ 


Defense  Technical  Information  Center  2 

Cameron  Station 
Alexandria,  VA  22314 

Mr.  Karl  H.  Shingler  1 

Department  of  the  Air  Force 
So^are  Engineering  Institute 
Joint  Program  Office  (ESD) 

Carnegie  Mellon  University 
Pittsburgh,  PA  15213-3890 

IDA 


,  General  Larry  D.  Welch,  HQ  1 

Ms.  Ruth  L.  Greenstein,  HQ  1 

Mr.  Philip  L.  Major,  HQ  1 

Dr.  Robert  E.  Roberts,  HQ  1 

Mr.  Kevin  J.  Rappoport,  SRC  2 

Ms.  Anne  Douville,  CSED  1 

>  Dr.  Richard  J.  Ivanetich,  CSED  1 

Mr.  Terry  Mayfield,  CSED  1 

Ms.  Katydean  Price,  CSED  2 

Dr.  Richard  Wexelblat,  CSED  1 

IDA  Control  &  Distribution  Vault  2 


I 


I 


I 


I 


Distribution  List-1 


